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Extracting understanding from the growing "sea" of biological and 
socio-economic data is one of the most pressing scientific challenges 
facing us. Here, we introduce and validate an unsupervised method 
that is able to accurately extract the hierarchical organization of 
complex biological, social, and technological networks. We define 
an ensemble of hierarchically nested random graphs, which we use 
to validate the method. We then apply our method to real-world 
networks, including the air-transportation network, an electronic cir- 
cuit, an email exchange network, and metabolic networks. We find 
that our method enables us to obtain an accurate multi-scale de- 
scriptions of a complex system. 

complex networks | hierarchical organization | multi-scale representation | cel- 
lular metabolism | transportation networks 

The high-throughput methods available for probing bio- 
logical samples have drastically increased our ability to 
gather comprehensive molecular-level information on an ever 
growing number of organisms. These data show that these 
systems are connected through a dense network of nonlinear 
interactions among its components [TJ [2], and that this in- 
terconnectedness is responsible for their efficiency and adapt- 
ability. At the same time, however, such interconnectedness 
poses significant challenges to researchers trying to interpret 
empirical data and to extract the "systems biology" princi- 
ples that will enable us to build new theories and to make 
new predictions [3]. 

A central idea in biology is that life processes are hierarchi- 
cally organized 2 4, 5, 6 and that this hierarchical structure 
plays an important role in their dynamics [?]. However, given 
a set of genes, proteins, or metabolites and their interactions, 
we still do not have an objective manner to assess whether 
such hierarchical organization does indeed exist, or to objec- 
tively identify the different levels in the hierarchy. 

Here, we report a new method that identifies the levels 
in the organization of complex systems and extracts the rele- 
vant information at each level. To illustrate the potetial of 
our method, it is useful to think of electronic maps as in 
://maps. google. com (Fig. SI). Electronic maps are powerful 
tools because they present information in a scalable manner, 
that is, despite the increase in the amount of information as 
we "zoom out," the representation is able to extract the in- 
formation that is relevant at the new scale. In a similar spirit, 
our method will enable researchers to characterize each scale 
with the relevant information at that scale. This achievement 
is key for the development of systems biology, but will en- 
counter application in many other areas. 

Background 

Complex networks are convenient representations of the in- 
teractions within complex systems [8]. Here, we focus on the 
identification of inclusion hierarchies in complex neworks, that 
is, to the unraveling of the nested organization of the nodes in 
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a network into modules, which are comprised of sub-modules 
and so orQ 

A method for the identification of the hierarchical orga- 
nization of nodes in a network must fulfill two requirements: 
(i) it must be accurate for many types of networks, and (ii) 
it must identify the different levels in the hierarchy as well as 
the number of modules and their composition at each level. 
The first condition may appear as trivial, but we make it ex- 
plicit to exclude algorithms that only work for a particular 
network or family of networks, but that will otherwise fail. 
The second condition is more restrictive, as it excludes meth- 
ods whose output is subject to interpretation. Specifically, 
a method does not fulfill the second condition if it organizes 
nodes into a tree structure, but it is up to the researcher to 
find a "sensible" criterion to establish which are the different 
levels in that tree. An implication of the previous two re- 
quirements is that any method for the identification of node 
organization must have a null output for networks, such as 
Erdos-Renyi random graphs, which do not have an internal 
structure. 

To our knowledge, there is no procedure that enables one 
to simultaneously assess whether a network is organized in a 
hierarchical fashion and to identify the different levels in the 
hierarchy in an unsupervised way. Ravasz et al. 12 studied 
the hierarchical structure of metabolic networks, but in their 
analysis the authors put emphasis on detecting "global sig- 
natures" of a hierarchical network architecture. Specifically, 
they reported that, for the metabolic networks studied and for 
certain hierarchical network models, the clustering coefficient 
of nodes appears to scale with the connectivity as C(k) ~ fe _1 . 
This scaling, however, is neither a necessary nor a sufficient 
condition for a network to be hierarchical |13] . 

More direct methods to investigate the hierarchical or- 
ganization of the nodes in a network have also been recently 
proposed 14, 15, 16 . Although useful in some contexts, these 
methods do not clearly identify hierarchical levels and thus fail 
to satisfy condition (ii) above. Furthermore, all these methods 
yield a tree even for networks with no internal structure. 

In the following, we define inclusion hierarchies in complex 
networks and describe an ensemble of hierarchically nested 
random graphs. We then introduce a method that is able to 
accurately extract the hierarchical organization of hierarchi- 
cal random graphs. Finally, we apply our method to several 
real- world networks. 
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1 We do not consider other hierarchical schemes that classify nodes according to, for instance, their 
importance 9 . Another issue that we do not address here is that of "overlapping" modules. In the 
literature, some authors refer to the existence of "soft" boundaries between communities I10II11I . 
However, there has been so far no rigorous connection between the soft boundaries and the overlap 
between communities. Moreover, at present, there is no theoretical model that includes overlapping 
modules, that is, modules that share nodes, as opposed to communities that share edges. 
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Inclusion hierarchies 

Consider the ensemble of networks comprised of N nodes, 
N = { rii : i = 1,...,N}, that hold membership in a set 



of nested groups, Q ={ g^ kl 



. k t ) 



£=1,2...}, where £ is the 



level at which the group is defined, and the labels k± . . . ke-± 
indicate the groups at higher levels in the hierarchy within 
which the group is nested. For instance, group jm is a group 
defined at £ — 3 that is nested inside group gn defined at 
£ — 2, which in turn is a subgroup of group gi defined at 
£ = 1. 

Let Qi C G be the set of groups in which node rn holds 
membership. Here, we consider that node holds member- 
ship in only one group per level, and that membership to 
groups follows a nested hierarchy. Therefore, for node m to 
hold membership in group gn, node n; must also hold mem- 
bership in group g\. 

We assume that the probability pij of the edge (ni,rij) 
being present in a network is a function solely of the set of 
co-memberships Mij = Gi n Qj of the two nodes. Note that 
our assumptions imply that: (i) Mij obeys transitivity, so 
that if Mij — Mik, then Mij = Mjkj and (ii) node member- 
ships in groups {gk 1 k 2 } a t the second level are uniquely and 
completely defined by the sub-network of connections of all 
nodes holding membership in group gk 1 , that is, information 
at deeper levels in the hierarchy is totally decoupled from the 
information at higher levels in the hierarchy. 

In the simplest scenario, pij is a non-decreasing function of 
the cardinality x of Mij, which implies that groups of nodes 
holding membership in the same groups will be more densely 
connected than a randomly selected group of nodes. This is 
precisely the underlying assumption in many algorithms aim- 
ing to detect the top level community structure of complex 
networks assuming a flat organization of the nodes [1711181119] . 

Let us now introduce an ensemble of random networks 
which are constructed following hierarchical node membership 
assignment: hierarchically nested random graphs. We restrict 
our ensemble to networks with a homogeneous hierarchical 
organization of the nodes (see Supplementary Information for 
other kinds of hierarchical organization) that have the same 
degree distribution as Erdos-Renyi graphs [20] . 

To illustrate the model, consider a network comprised 
of 640 nodes that hold membership in a set of groups G 
with a three-level homogeneous nested organization. We as- 
sign group memberships so that the number Si of nodes 
holding membership in each group for £ = 1,2, and 3 is 
Si = 160, 5a = 40, and S 3 = 10, respectively. For £ = 1, 
nodes can hold membership in one of four different groups 
{ gkx '■ ki = 1,...,4}. For 1 = 2, nodes holding member- 
ship in group gk 1 can hold membership in one of four groups 
{gk 1 k 2 '■ ^2 = 1, ...,4}. Finally, for £ = 3, nodes holding 
membership in groups gk ± and Pfc 1 fe 2 can hold membership in 
one of four groups { g^hskg : k 3 = 1, . . . ,4}. 

The probability p x of edge (rii , rij ) existing is a mono- 
tonically growing function that depends exclusively on the 
cardinality x of Mij. Thus, if the expected number of links 
between m and nodes {{n^} : ||A^ij|| = x} is k x = p x S x . 
Probabilities are chosen so that the average degree of a node 
is k = X^=(T k £ ' an d t ne ratio p = k < i/ki is constant through- 
reason for such 
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out the levels, where k < e = X^'^o ^ 

choice is to facilitate both the graphic representation and the 
interpretation of the results. Note that, for p < 1, deeper lev- 



els are more cohesive, whereas for p > 1, they are less cohesive 
(Supplementary Information). 

Extracting the hierarchical organization of networks 

Our method consists of two major steps (Fig. [I]): (i) measuring 
the "proximity" in the hierarchy between all pairs of nodes, 
which we call node affinity; and (ii) uncovering the overall hi- 
erarchical organization of node affinities, or, in other words, 
detecting the underlying organization of group memberships. 

Node affinity — A standard approach for quantifying the affin- 
ity between a pair of nodes in a network is to measure their 
"topological overlap" |12II21|[22] , which is defined as the ratio 
between the number of common neighbors of the two nodes 
and the minimum degree of the two nodes. This measure 
identifies affinity between nodes with a dense pattern of local 
connections. Because topological overlap is a local measure, it 
will fail to detect any structure when a network is not locally 
dense (Fig.[2|. 

We propose a new affinity measure based on surveying of 
the modularity landscape 23 , a collective property of the net- 
work. Our definition of affinity between nodes draws upon the 
idea that modules correspond to sets of nodes which are more 
strongly interconnected than one would expect from chance 
alone [231 124] . We show below that our affinity measure de- 
tects the modular structure even in the absence of a dense 
pattern of local connections. 

Consider the ensemble V of all partitions of a network into 
modules 23 , 25 , and assign to each partition P the modular- 
ity 

U ( di S ' 
Z ~ V 2L 



M(P) = J2 



1] 



where L is the total number of links in the network, U is the 
number of links within module i, di is the sum of degrees of 
all the nodes inside module i, and the sum is over all the m 
modules in partition P (Fig.[TJ\). The modularity of a parti- 
tion is high when the number of intra-module links is much 
larger than expected for a random partition. 

Let Pmax be the set of partitions for which the modularity 
M is a local maxima, that is, partitions for which neither the 
change of a single node from one module to another nor the 
merging or splitting of modules will yield a higher modular- 
ity [26] . Let B max = { b(P) : P € P m „ } be the sizes of the 
"basin of attraction" of those maxima. The affinity Aij of 
a pair of nodes is then the probability that when local 

maxima P £ 'Pmax are sampled with probabilities b(P), nodes 
are classified in the same module. 

Note that, in contrast to other affinity measures proposed 
in Refs. [Ill 1161 123] . the measure we propose does not neces- 
sarily coincide with the "optimal" division of nodes into mod- 
ules, that is, the partition that maximizes M [27]. In fact, the 
modules at the top level of the hierarchy do not necessarily 
correspond to the best partition found for the global network, 
even for relatively simple networks (Fig.|2p). 

Statistical significance of hierarchical organization — Given a 
set of elements and a matrix of affinities between them, a 
commonly used tool to cluster the elements and, presumably, 



For example, for the three-level network described earlier, and k — 16 and p — 1, feg = 
ki — 4, k 2 = 2, and k^ — 3 (see Supplementary Material for the expression of p x ). 
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uncover their hierarchical organization is hierarchical cluster- 
ing 28 , 29 . Hierarchical clustering methods have three major 
drawbacks: (i) They are only accurate at a local level — at ev- 
ery step a pair of units merge and some details of the affinity 
matrix are averaged with an inevitable loss of information; 
(ii) the output is always a hierarchical tree (or dendogram), 
regardless of whether the system is indeed hierarchically or- 
ganized or not; (iii) there is no statistically sound general 
criterion to determine the relevant levels on the hierarchy. 

In order to overcome the first caveat of agglomerative 
methods such as hierarchical clustering, one necessarily has 
to follow a top to bottom approach that keeps the details 
of the matrix. That is the spirit of divisive methods such 
as k- means or principal component analysis [28], which group 
nodes into "clusters" given an affinity matrix. However, these 
methods have a significant limitation: the number of clusters 
is an external parameter, and, again, there is no sound and 
general criterion to objectively determine the correct number 
of clusters. 

Because of the caveats of current agglomerative and di- 
visive methods, we propose a "box-clustering" method that 
iteratively identifies in an unsupervised manner the modules 
at each level in the hierarchy. Starting from the top level, each 
iteration corresponds to a different hierarchical level (Fig.|2|. 

In order to assess whether the network under analysis has 
an internal organization we need to compare with the ap- 
propriate null model, which in this case is an ensemble of 
"equivalent" networks with no internal organization. These 
equivalent networks must have the same number of nodes and 
an identical degree sequence. A standard method for gen- 
erating such networks is to use the Markov-chain switching 
algorithm 30, 31 . Despite their having no internal structure, 
these networks have numerous partitions with non-zero mod- 
ularity [25]. Thus, to quantify the level of organization of a 
network, one needs to compare the modularities of the sam- 
pled maxima for the original network and its corresponding 
random ensemble; if the network has a non-random internal 
structure, then local maxima in the original landscape should 
have larger modularities than local maxima in the landscapes 
of the randomized networks. 

Specifically, for a given network, we compute the average 
modularity M av from { M(P) : P £ P max }. Then, we com- 
pute the same quantity M av for each network in the equivalent 
random ensemble. In virtue of the central limit theorem, the 
set of average modularities for the whole ensemble { Ml v } is 
normally distributed with mean M ran d and variance <7m 
To quantify the level of organization of a network, we thus 
compute the z-score of the average modularity 

M av ~ M r 

and r„i 



If z is larger than a threshold value zt, then the network has in- 
ternal structure and we proceed to identify the different mod- 
ules, otherwise we conclude that the network has no structure. 
In what follows, we show results for zt = 2.3267, which corre- 
sponds to a 1% significance level (Supplementary Material^] 
Building the hierarchical tree — In networks organized in a hi- 
erarchical fashion, nodes that belong to the same module at 
the bottom level of the hierarchy have greater affinity than 
nodes that are together at a higher level in the hierarchy. 
Thus, if a network has a hierarchical organization, one will be 
able to order the nodes in such a way that groups of nodes 
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with large affinity are close to each oder. With such an or- 
dering, the affinity matrix will then have a "nested" block- 
diagonal structure (Fig.Q. This is indeed what we find for 
networks belonging to the ensemble of hierarchically nested 
random graphs (Fig.|2|. 

For real-world networks, we do not know a priori which 
nodes are going to be co-classified together, that is, we do not 
know which is the ordering of the nodes for which the affinity 
matrix has a nested block-diagonal structure. To find such an 
ordering, we use simulated annealing [32] to minimize a cost 
function that weighs each matrix element with its distance to 
the diagonal |33] 

1 N 

where N is the order of the affinity matrix (see Fig.[TJ\ and 
Supplementary Information for alternative ordering schemes) . 

This problem belongs to the general class of quadratic as- 
signment problems [51]. Other particular cases of quadratic 
assignment problems have been suggested to uncover differ- 
ent features of similarity matrices [35] Our algorithm is able 
to find the proper ordering for the affinity matrix and to ac- 
curately reveal the structure of hierarchically nested random 
graphs (Fig.j2|. 



Unsupervised extraction of the structure — Given an 
ordered affinity matrix, the last step is to partition the nodes 
into modules at each relevant hierarchical level. An ansatz 
that follows naturally from the considerations in the previous 
section and the results in Fig. [2] is that, if a module at level £ 
(or the whole network at level 0) has internal modular struc- 
ture, the corresponding affinity matrix is block-diagonal: At 
level I, the matrix displays boxes along the diagonal, such that 
elements inside each box s have an affinity A\, while matrix 
elements outside the boxes have an affinity Bg < A\. Note 
that the number of boxes for each affinity matrix is not fixed; 
we determine the "best" set of boxes by least squares fitting 
of the block-diagonal model to the affinity matrix. 

Importantly, we want to balance the ability of the model 
to accurately describe the data with its parsimony, that is, we 
do not want to over-fit the data. Thus, we use the Bayesian 
information criterion in order to determine the best set of 
boxes [36] Q 

To find the modular organization of the nodes at the top 
level (level 1), we fit the block diagonal model to the global 
affinity matrix. As we said previously, we assume that the 
information at different levels in the hierarchy is decoupled, 
thus in order to detect sub-modules beyond the first level, one 
needs to break the network into the sub-networks defined by 
each module and apply the same procedure (Fig. [I]). The al- 
gorithm iterates these steps for each identified box until no 
sub-networks are found to have internal structure. 



' Results for real networks at a 5% significance level are identical, however, the more stringent 
threshold is more efficient at detecting the last level in the hierarchy for model networks. Only for 
a 1-3% of the cases — depending on the cohesiveness of the levels — do we find that algorithm finds 
one more level than expected. 

4 We have also applied Akaike's information criterion 1371 . obtaining the same results for most of 
the cases. 

PNAS | Issue Date | Volume | Issue Number | 3 



Method validation 

We validate our method on hierarchically nested random 
graphs with one, two, and three hierarchical levels. We de- 
fine the accuracy of the method as the mutual information 
between the empirical partition and the theoretical one |38j . 
Figure[2p shows that the algorithm uncovers the correct num- 
ber of levels in the hierarchy. 

Moreover, our method always detects the top level, even 
for the networks with three hierarchical levels. In contrast, 
because the partition that globally maximizes M corresponds 
to the sub-modules in the second level, even the more ac- 
curate module identification algorithms based on modularity 
maximization would fail to capture the top level organization 
(Joshi et al. 2007, [27]). 

The hierarchically nested random graphs considered above 
have a homogeneous hierarchical structure; however, real- 
world networks are not likely to be so regular. In particu- 
lar, for real-world networks one expects that some modules 
will have deeper hierarchical structures than others. We thus 
have verified that our method is also able to correctly un- 
cover the organization of model networks with heterogeneous 
hierarchical structures (Supplementary Information). 

Testing on real world networks 

Having validated our method, we next analyze different types 
of real-world networks for which we have some insight into 
the network structure: the world-wide air-transportation net- 
work |39l 1401 141j . an e-mail exchange network of a Catalan 
university [14], and an electronic circuit [6]. 

In the air transportation network, nodes correspond to 
airports and two nodes are connected if there is a non-stop 
flight connecting them. In the email network, nodes are peo- 
ple and two people are connected if they send emails to each 
other. In the electronic network, nodes are transistors and 
two transistors are connected if the output of one transistor 
is the input of the other (Table]!]). 

We find that the air-transportation network is strongly 
modular and has a deep hierarchical organization (Fig. [3]. 
This finding does not come as a surprise since historical, eco- 
nomic, political, and geographical constraints shape the topol- 
ogy of the network [391 1401 |4"T] . We find eight main modules 
that closely match major continents and sub-continenets, and 
major political divisions and thus truly represent the highest 
level of the hierarchjj^] 

The electronic circuit network is comprised of eight D- 
flipflops and 58 logic gates |Bj. Our method identifies two 
levels in the network (Fig.|4]A_). At the top level, modules are 
groups of logic gates, all the logic gates comprising a D-flipflop 
being in the same module. At the second level, the majority 
of modules comprise single gates. 



For the email network, five of the seven major modules 
at the top level (Fig.|4j3) correspond to schools in the univer- 
sity, with more than 70% of the nodes in each of those mod- 
ules affiliated to the corresponding school. The remaining two 
major modules at the top level are a mixture of schools and 
administration offices (often collocated on campus), which are 
distinctly separated at the second level. The second level also 
identifies major departments and groups within a school, as 
well as research centers closely related to a school. 

Application to metabolic networks 

Finally, we analyze the metabolic networks of E. coli obtained 
from two different source^] (Fig.JHJ: the KEGG database 
[441 145] . and the reconstruction compiled by Palsson's Sys- 
tems Biology Lab at UCSD [46]. In these networks, nodes are 
metabolites and two metabolites are connected if there is a 
reaction that transforms one into the other 47 . 

To quantify the plausability of our classification scheme, 
we analyze the within-module consistency of metabolite path- 
way classification for the top and the second levels of the 
metabolic network for E. coli reconstructed at UCSD [46] . 
For each module, we first identify the pathways represented; 
then, we compute the fraction of metabolites that are classi- 
fied in the most abundant pathway. We find that there is a 
clear correlation between modules and known pathways: At 
the top level, for all the modules except one, we find that 
the most abundant pathway comprises more than 50% of the 
metabolites in the module. 

For the second level, we find that for most of the modules 
all the metabolites are classified in the same pathway. We also 
detect smaller pathways that are not visible at the top level 
(such as those for polyketides and nonribosomal peptides, and 
for secondary metabolites). 

Our results thus provide an objective description of cel- 
lular metabolism that, while not affected by human subjec- 
tivity, captures our current understanding of these networks. 
Interestingly, "known" pathways do not correspond to a sin- 
gle module at the top level, implying that large pathways are 
in fact comprised of smaller units. Intriguingly, these units 
are not necessarily uniform in "pathway composition" but are 
a mixture of sub-modules associated to different pathways. 
Thus, an important question is how the modules we identify 
relate to metabolism evolution [48] . 
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Fig. 1. Schematic illustration of our method. Step 1: Affinity matrix. Sampling of the maxima of the of the modularity landscape. We use the co-classification of nodes 
in the same module for partitions that are a local maxima of the modularity landscape as a measure of the affinity between the nodes. We then verify whether the network 
has a non-random internal structure. If it does not, we stop here. Step 2a: Ordering the affinity matrix and extraction of the hierarchical organization. The affinity matrix will 
show a hierarchical organization of the nodes, if pairs of nodes with high affinities occupy contiguous rows in the matrix. To find the optimal ordering of the nodes, we define a 
"cost function" that weighs each matrix element by its distance to the diagonal. Step 2b: Extracting the hierarchical organization. The signature of a hierarchical organization 
i the existence of a nested block diagonal structure in the affinity matrix. In order to identify the different modules (boxes) at each level I in the hierarchy, we propose an 



ansatz matrix with 71 boxes of identical elements along the diagonal Af, for , 



0, 



, 71, and identical elements Bg outside the boxes. We use a "least-squares" method 



combined with a "greedy algorithm" to determine the partition that best fits the model (see text and Supplementary Information). We go back to step la for each one of the 
sub-networks defined by the partition. 



Network 


Size 


Modules 


Main modules 


Air transportation 


3618 


57 


8 


Email 


1133 


41 


8 


Electronic circuit 


516 


18 


11 


E. coli KEGG 


739 


39 


13 


E. coh UCSD 


507 


28 


17 



Table 1. Top-level structure of real-world networks. We show both the total number of modules and the number of main 
modules at the top level. Main modules are those comprised of more than 1% of the nodes. Note that there is no correlation 
between the size of the network and the number of modules. 
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Fig. 2. Affinity measures and clustering methods. We generate two model networks comprised of 640 nodes with average degree 16. A, Modular network with "flat" 
structure. The network comprises four modules with 160 nodes each. The nodes have an average of eleven within-module connections and five inter-module connections; 
B, Modular network with a three-level hierarchical structure. We show affinity matrices Aij obtained for two different measures: (i) topological overlap 12 ; (ii) "co- 
classification" (see text and Supplementary Information). The color scale goes from red for a probability of one to dark blue for a probability of zero. At the far right, we show 
the hierarchical tree obtained using two different methods: hierarchical clustering and the "box clustering" we propose. In the hierarchical clustering tree, the vertical axis 



shows the average distance, di 



1 — Aij, of the matrix elements that have already merged. In the box-model clustering tree, each row corresponds to one hierarchical 



level. Different colors indicate different modules at that level. To better identify which are the sub-modules at a lower level, we color the nodes in the sub-modules with 
shades of the color used for the modules in the level above. Note that topological overlap fails to find any modular structure beyond a locally dense connectivity pattern. In 
contrast, the co-classification measure clearly reveals the hierarchical organization of the network by the "nested-box" pattern along the diagonal. Significantly, the hierarchical 
tree obtained via hierarchical clustering fails to reproduce the clear three-level hierarchical structure that the affinity matrix displays, whereas the box-model clustering tree 
accurately reproduces the three-level hierarchical organization of the network. C, Accuracy of the method. We generate networks with 640 nodes and with built-in hierarchical 
structure comprising one (left), two (middle), and three (right) levels. The top level always comprises four modules of 160 nodes each. For networks with a second level, each 
of the top-level modules is organized into four sub-modules of 40 nodes. For the networks with three levels, each level-two module is further split into four sub-modules of ten 
nodes. We build networks with different degrees of level cohesiveness by tuning a single parameter p (see text). Since we know a priori which are the nodes that should 
be co-classified at each level, we measure the accuracy as the mutual information between the empirical partition of the nodes and the theoretical one |38|. We also plot the 
accuracy of a standard community detection algorithm 49 in finding the top level of the networks (dashed green line). We plot the mutual information versus p for networks 
with one (left), two (center) and three (right) hierarchical levels. Each point is the average over ten different realizations of the network. Full circles, empty squares, and full 
diamonds represent the accuracy at the top, middle, and lowest levels, respectively. 
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Fig. 3. Hierarchical organization of the air-transportation network. A, Global-level affinity matrix and hierarchical tree (the representation is the same used in Fig.[2j. B, 
Top-level modules. Each dot represents an airport and different colors represent different modules. Note that the top level in the hierarchy corresponds roughly to geo-political 
units. The "orange" module (comprised of the majority of European countries, ex-Soviet Union countries, Middle-Eastern countries, India, and countries in Northern half of 
Africa) splits for levels t = 2 (C) and t = 3 (D). 




Fig. 4. Hierarchical structure of technological and social networks. We show the ordered affinity matrices at the top level and the hierarchical trees that we obtain for A, 
the transistor implementation of an electronic circuit [6], and B, the email exchange network of a Catalan university [14] . Our method is capable of accurately uncovering the 
top level organization of the networks. For the transistor network, which is comprised of eight D-type flipflops and 58 logic gates, we find that at the top level, gates comprising 
a given D-flipflop are classified in the same module. A the second level, the majority of the modules are comprised of a single gate. For the email network, at the top level we 
find eight modules that closely match the organization of the schools and centers in the university |14| . 
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Fig. 5. Hierarchical structure of metabolic networks. Global level affinity matrices and hierarchical trees for the metabolic networks of E. coli obtained from: A, the KEGG 
database 44 45 , and B, the Systems Biology group at UCSD [46] . Note that the overall organization of the networks is similar and independent of the reconstruction used 
to build the network. C, For the metabolic network of E. coli from the Systems Biology group at UCSD, we analyze the within-module consistency of metabolite pathway 
classification for the first (top plot) and the second (bottom plot) levels. For each module, we first identify the pathway classifications of the corresponding metabolites; then, 
we compute the fraction of metabolites that are classified in the most abundant pathway. In the plots, each bar represents one module, its width being proportional to the 
number of nodes it contains. We color each bar according to its most abundant pathway following the color code on the right hand side. At the second level (bottom plot), 
we show each sub-module directly below its corresponding top level module. Again, the width of each sub-module is proportional to its size. Note that, at the first level (top), 
for all modules except one, the most abundant pathway is comprised of more than 50% of the metabolites in the module. Remarkably, at the second level (bottom), for most 
of the modules all the metabolites are classified in the same pathway. Moreover, at the second level, we detect smaller pathways that are not visible at the top level. 
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